Rare variant testing across methods and thresholds using the multi-kernel sequence kernel association test (MK-SKAT).

نویسندگان

  • Eugene Urrutia
  • Seunggeun Lee
  • Arnab Maity
  • Ni Zhao
  • Judong Shen
  • Yun Li
  • Michael C Wu
چکیده

Analysis of rare genetic variants has focused on region-based analysis wherein a subset of the variants within a genomic region is tested for association with a complex trait. Two important practical challenges have emerged. First, it is difficult to choose which test to use. Second, it is unclear which group of variants within a region should be tested. Both depend on the unknown true state of nature. Therefore, we develop the Multi-Kernel SKAT (MK-SKAT) which tests across a range of rare variant tests and groupings. Specifically, we demonstrate that several popular rare variant tests are special cases of the sequence kernel association test which compares pair-wise similarity in trait value to similarity in the rare variant genotypes between subjects as measured through a kernel function. Choosing a particular test is equivalent to choosing a kernel. Similarly, choosing which group of variants to test also reduces to choosing a kernel. Thus, MK-SKAT uses perturbation to test across a range of kernels. Simulations and real data analyses show that our framework controls type I error while maintaining high power across settings: MK-SKAT loses power when compared to the kernel for a particular scenario but has much greater power than poor choices.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Rare-variant association testing for sequencing data with the sequence kernel association test.

Sequencing studies are increasingly being conducted to identify rare variants associated with complex traits. The limited power of classical single-marker association analysis for rare variants poses a central challenge in such studies. We propose the sequence kernel association test (SKAT), a supervised, flexible, computationally efficient regression method to test for association between gene...

متن کامل

Optimal tests for rare variant effects in sequencing association studies.

With development of massively parallel sequencing technologies, there is a substantial need for developing powerful rare variant association tests. Common approaches include burden and non-burden tests. Burden tests assume all rare variants in the target region have effects on the phenotype in the same direction and of similar magnitude. The recently proposed sequence kernel association test (S...

متن کامل

Rare variant analysis of blood pressure phenotypes in the Genetic Analysis Workshop 18 whole genome sequencing data using sequence kernel association test

Sequence kernel association test (SKAT) has become one of the most commonly used nonburden tests for analyzing rare variants. Performance of burden tests depends on the weighting of rare and common variants when collapsing them in a genomic region. Using the systolic and diastolic blood pressure phenotypes of 142 unrelated individuals in the Genetic Analysis Workshop 18 data, we investigated wh...

متن کامل

Rare Variant Association Testing for Sequencing Data Using the Sequence Kernel Association Test (SKAT)

*These authors contributed equally to this work. 1 Department of Biostatistics, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA 2 Department of Biostatistics, Harvard School of Public Health, Boston, MA 02115, USA 3 Department of Genetics, The University of North Carolina at Chapel Hill, Chapel Hill, NC 27599, USA 4 Department of Biostatistics and Center for Statisti...

متن کامل

Power and sample size calculations for designing rare variant sequenc - ing association studies

Recently, Wu et al. [4] have proposed the sequence kernel machine test (SKAT) to test association between genetic variants in a gene or region and a continuous or binary trait. SKAT, which uses the kernel machine regression framework, is very flexible and computationally efficient. From extensive simulation studies and real data application, it has been shown that SKAT is more powerful than the...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Statistics and its interface

دوره 8 4  شماره 

صفحات  -

تاریخ انتشار 2015